Fix write_dataframe_async parallel write overwriting #1310

cubewise-gng · 2025-11-07T23:51:17Z

To address #1307 write_dataframe_async to aggregate dataframe before parallel processing.

MariusWirtz · 2025-11-08T21:06:05Z

Would it be easier if you call the existing aggregate_duplicate_intersections function directly from the write_dataframe_async function?

cubewise-gng · 2025-11-09T23:57:21Z

In case the dataframe contains a mix of string and numeric values.
I did not want to aggregate strings by calling aggregate_duplicate_intersections directly.
Is that the right approach?

MariusWirtz · 2025-12-07T21:58:30Z

In case the dataframe contains a mix of string and numeric values.
I did not want to aggregate strings by calling aggregate_duplicate_intersections directly.
Is that the right approach?

Yes. Good catch. If increment is True, we should increment the numeric values and apply last-one-wins on string values.

Changing the increment default value to True is technically a breaking change, but I would argue that this counts as a bugfix.
With increment set to False (the current behaviour), the function produces nondeterministic results when handling duplicate numeric records.

I will run all tests and merge

MariusWirtz · 2025-12-07T22:15:32Z

TM1py/Utils/Utils.py

    cellset = CaseAndSpaceInsensitiveTuplesDict(
        dict(zip(df.iloc[:, :-1].itertuples(index=False, name=None), df.iloc[:, -1].values))
    )
    return cellset


This should return a dataframe.

Currently, it returns a dictionary and that breaks existing code and existing test cases.

Thanks, made the correction.

github-actions · 2025-12-07T23:25:50Z

Tests completed for environment: tm1-11-cloud. Check artifacts for details.

Fix parallel write overwriting each other.

9731410

MariusWirtz reviewed Dec 7, 2025

View reviewed changes

Correction return df

6cb6bb9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix write_dataframe_async parallel write overwriting #1310

Fix write_dataframe_async parallel write overwriting #1310

Uh oh!

cubewise-gng commented Nov 7, 2025

Uh oh!

MariusWirtz commented Nov 8, 2025

Uh oh!

cubewise-gng commented Nov 9, 2025

Uh oh!

MariusWirtz commented Dec 7, 2025

Uh oh!

MariusWirtz Dec 7, 2025

Uh oh!

cubewise-gng Dec 7, 2025

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix write_dataframe_async parallel write overwriting #1310

Are you sure you want to change the base?

Fix write_dataframe_async parallel write overwriting #1310

Uh oh!

Conversation

cubewise-gng commented Nov 7, 2025

Uh oh!

MariusWirtz commented Nov 8, 2025

Uh oh!

cubewise-gng commented Nov 9, 2025

Uh oh!

MariusWirtz commented Dec 7, 2025

Uh oh!

MariusWirtz Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

cubewise-gng Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants